Method and apparatus for an improved bulk read socket call

ABSTRACT

An apparatus and method for an improved bulk read socket call are provided. With the apparatus and method, a new field, so_rcvlen, is added to the socket structure that identifies the bulk read size requested by the user. The kernel of the prior art recv( ) function is also modified so that it sets the so_rcvlen to the size requested by the user prior to the recv( ) function going to sleep and waiting for the full data size requested by the user. A new flag, SP_MSGWAITALL, is also provided in the socket structure. In the TCP input processing, when data is received for a particular socket, the current setting of the SP_MSGWAITALL is checked. If the SP_MSGWAITALL flag is set, it is determined whether the amount of data stored in the socket receive buffer is less than the value of so_rcvlen. If not, the TCP input processing does not wake up the recv( ) thread. However, for every alternate segment, the TCP input processing sends back an acknowledgment (ACK). In the TCP output processing, when the SP_MSGWAITALL flag is set and the amount of data in the socket receive buffer is less than so_rcvlen, the full window is advertised. Once the TCP input processing determines that there is at least an amount of data in the socket receive buffer equal to the value of so_rcvlen, the TCP input processing will wake up the recv( ) thread and the SP_MSGWAITALL flag is reset.

BACKGROUND OF THE INVENTION

1. Technical Field

The present invention is directed to a method and apparatus for animproved bulk read socket call.

2. Description of Related Art

The Internet has become a significant communication media in the modernworld and is enabling the world to migrate to one global datacommunications system. The Internet uses the Transmission ControlProtocol/Internet Protocol (TCP/IP) suite of protocols to provide acommon communications mechanism for computers, and other datatransmission devices, to communicate with one another.

Communication with applications running on a server is typicallyperformed using ports and addresses assigned to the application and theserver apparatus. A port may be a physical port or a logical port. Aphysical port is a pathway into and out of a computer or a networkdevice such as a switch or router. For example, the serial and parallelports on a personal computer are external sockets for plugging incommunications lines, modems and printers. Every network adapter has aport (Ethernet, Token Ring, etc.) for connection to the local areanetwork (LAN). Any device that transmits and receives data implies anavailable port to connect to each line.

A logical port is a number assigned to an application running on theserver by which the application can be identified. While a server mayhave a single physical port, the server may make use of a plurality oflogical ports. The combination of a logical port identifier and theaddress of the server apparatus is referred to as a socket.

The address of the server is a network address that identifies theserver in the network and how to route data to a particular physicalport of the server through the network. The address may take the form ofa Uniform Resource Locator (URL), or in the case of the Internet, anInternet Protocol (IP) address such as 205.15.01.01, or the like. Theaddress is included in headers of data packets transmitted by a device.The data packets are routed through the network from device to device byreading the header of the data packet and determining how to route thedata packet to its intended destination based on the address.

Once the data packet arrives at the intended destination serverapparatus, the server determines the destination application based onthe logical port identifier included in the header information of thedata packet. A data packet may be directed to a particular logical portby including the logical port identifier in its header information.

An application on a server “listens” to a logical port by retrievingdata having a logical port identifier that identifies the logical portassociated with that application. The application will take the datadirected to its logical port and place it in a queue for theapplication. In this way, data may be routed through a network to aserver apparatus and provided to a particular application on the serverapparatus for processing.

The TCP/IP protocol provides various socket functions that may be usedin the handling of data as the data is transported to and from anapplication through the socket. One such function that is typically usedby FTP file and print services is the recv(int sock, (void *) buffer,int flags) read function. This read function further has a known featureMSG_WAITALL that allows an application to read a large amount of data atone time as a bulk read instead of reading a large amount of data doingmultiple calls of the recv( ) function.

For example, assume that a user wishes to read data in bulk units of60,000 bytes. Using the MSG_WAITALL bulk read feature of the recv( )function, each time data is stored in the receive socket buffer, therecv( ) bulk read function is awakened. The recv( ) examines the receivesocket buffer to determine if 60,000 bytes are in the receive socketbuffer. If not, the recv( ) goes back to sleep and waits until anotheramount of data is stored in the receive socket buffer when it will againbe awakened. If there is 60,000 bytes in the receive socket buffer, thisamount of data is read from the receive socket buffer and provided tothe calling application.

With this bulk read function, a single call to the recv( ) function ismade rather than multiple calls and thus, the overhead associated withthe extra system call execution is avoided. However, this feature, whenused with TCP sessions has many limitations.

First, even though the recv( ) function waits for the full amount ofdata the user has requested, the TCP wakes up the blocked thread, i.e.the thread from the calling application that calls recv( ), each time adata segment arrives. The thread wakes up and checks if the full datahas arrived. If not, it goes back to sleep again. For a 64 Kb recv( )function call, for example, receiving 1460 byte Maximum Segment Size(MSS) TCP segments, this would result in approximately 43 unnecessarywakeups of the thread and the associated overhead.

Second, when MSG_WAITALL is used, TCP acknowledgments are delayed up to200 milliseconds. The reason for this is that acknowledgments aretriggered by the application reading at least 2 MSS worth of data fromthe receive socket buffer. However, when the MSG_WAITALL feature isused, since the data remains in the receive socket buffer until the userrequested amount of data is gathered in the buffer, acknowledgments willnot be sent until the delayed acknowledgment timer expires. The delayedacknowledgment timer is a timer that delays the sending of anacknowledgment of receipt of data up to 200 milliseconds in anticipationof sending the acknowledgment with data that needs to be sent in thereverse direction. Delaying the acknowledgments so much causes a numberof problems.

For example, the TCP's congestion control and avoidance schemes, such asslow start and the like, depend heavily on incoming acknowledgments.Slow start, for example, is the phase of data transmission in which thesending computing device sends data slowly and increases the data flowon the arrival of each acknowledgment from the receiving computingdevice.

Thus, for example, if the acknowledgments are delayed, during the slowstart phase, the congestion window will open up very slowly, i.e. thedata flow will increase very slowly. The congestion window is ameasurement of the amount of data that a sender may send to a receivingcomputing device and avoid causing congestion in the data network.

In addition, the fast recovery mechanism after detecting packet lossrecovers solely depending on the arrival of acknowledgments. Forexample, assume that a receiver is waiting for 32 Kb of data on a recv() function call with MSG_WAITALL set and a sender's congestion window iscurrently 22 segments, i.e. TCP packets, of 1460 bytes. If a data packetgets dropped, the fast retransmit algorithm retransmits the droppedsegment after receiving three duplicate acknowledgments but also halvesthe congestion window to 11 segments to thereby slow down the datatraffic in the network.

Assuming, for example, a single packet drop, the receiver wouldacknowledge all 22 segments on receiving the dropped segment and therecv( ) function should complete. However, for the next “send”, thesender is allowed to send only 11 segments whereas the receiver iswaiting for the full 32 Kb. There will now be a pause until the 200millisecond delayed acknowledgment timer expires. Then the TCP wouldacknowledge the 11 segments. Now the sender can send the next 11segments. On receiving the next 11 segments, the recv( ) function wouldalso complete. However, since the sender is now in a fast recover phase,the congestion window opens up by only 1/11th of the segment size peracknowledgment. Therefore, for the next recv( ) call, the same 200millisecond delay occurs. This continues until the congestion windowgrows back to 22 segments. The above example considers only one segmentloss for this duration. The situation is considerably worse whenmultiple packet drops occur.

Third, when the MSG_WAITALL flag is used, since the receiver'sadvertised window, i.e. the size of the receivers TCP buffer, keepsreducing until the recv( ) function gets the full data requested by theuser, a situation may occur where the sender hits the persist timerdelays (minimum of 5 seconds in most implementations). The persist timerdelays are the delays perceived by the sending computing system due toprobing of the receiving computing system to determine when thereceiving computing system TCP buffer is no longer full. This is causeddue to the fact that TCP is byte oriented and not message oriented.

A 32 Kb message written by the sender gets packetized and depacketizedby TCP in a manner determined by TCP. When the receiver side windowreduces to a value less than the MSS used by the connection, the senderdefers sending this small amount of data (if it has enough data queuedin its send buffer) until the receiver opens up its window because thesender thinks the receiver is busy and is slow to read the data. Thismay not be the case, however, because the receiver might actually bewaiting on the recv( ) function for just this small piece of data tomake the 32 Kb that the user has requested.

Thus, it would be beneficial to have a method and apparatus for animproved bulk read socket call that avoids the drawbacks of the priorart outlined above.

SUMMARY OF THE INVENTION

The present invention provides an apparatus and method for an improvedbulk read socket call are provided. With the apparatus and method of thepresent invention, a new field, so_rcvlen, is added to the socketstructure that identifies the bulk read size requested by the user. Thekernel of the prior art recv( ) function is also modified so that itsets the so_rcvlen to the size requested by the user prior to the recv() function going to sleep and waiting for the full data size requestedby the user. In addition, a new flag, SP_MSGWAITALL, is also provided inthe socket structure.

In the TCP input processing of the present invention, when data isreceived for a particular socket, the current setting of theSP_MSGWAITALL is checked. If the SP_MSGWAITALL flag is set, it isdetermined whether the amount of data stored in the socket receivebuffer is less than the value of so_rcvlen. If not, the TCP inputprocessing does not wake up the recv( ) thread. However, for everyalternate segment, the TCP input processing sends back an acknowledgment(ACK).

In the TCP output processing of the present invention, when theSP_MSGWAITALL flag is set and the amount of data in the socket receivebuffer is less than so_rcvlen, the full window is advertised. Once theTCP input processing determines that there is at least an amount of datain the socket receive buffer equal to the value of so_rcvlen, the TCPinput processing will wake up the recv( ) thread and the SP_MSGWAITALLflag is reset.

BRIEF DESCRIPTION OF THE DRAWINGS

The novel features believed characteristic of the invention are setforth in the appended claims. The invention itself, however, as well asa preferred mode of use, further objectives and advantages thereof, willbest be understood by reference to the following detailed description ofan illustrative embodiment when read in conjunction with theaccompanying drawings, wherein:

FIG. 1 depicts a pictorial representation of a network of dataprocessing systems in which the present invention may be implemented;

FIG. 2 is a block diagram of a data processing system that may beimplemented as a server in accordance with a preferred embodiment of thepresent invention;

FIG. 3 is a block diagram illustrating a data processing system in whichthe present invention may be implemented; and

FIG. 4 is a flowchart outlining an exemplary operation of the presentinvention.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT

With reference now to the figures, FIG. 1 depicts a pictorialrepresentation of a network of data processing systems in which thepresent invention may be implemented. Network data processing system 100is a network of computers in which the present invention may beimplemented. Network data processing system 100 contains a network 102,which is the medium used to provide communications links between variousdevices and computers connected together within network data processingsystem 100. Network 102 may include connections, such as wire, wirelesscommunication links, or fiber optic cables.

In the depicted example, a server 104 is connected to network 102 alongwith storage unit 106. In addition, clients 108, 110, and 112 also areconnected to network 102. These clients 108, 110, and 112 may be, forexample, personal computers or network computers. In the depictedexample, server 104 provides data, such as boot files, operating systemimages, and applications to clients 108–112. Clients 108, 110, and 112are clients to server 104. Network data processing system 100 mayinclude additional servers, clients, and other devices not shown. In thedepicted example, network data processing system 100 is the Internetwith network 102 representing a worldwide collection of networks andgateways that use the TCP/IP suite of protocols to communicate with oneanother. At the heart of the Internet is a backbone of high-speed datacommunication lines between major nodes or host computers, consisting ofthousands of commercial, government, educational and other computersystems that route data and messages. Of course, network data processingsystem 100 also may be implemented as a number of different types ofnetworks, such as for example, an intranet, a local area network (LAN),or a wide area network (WAN). FIG. 1 is intended as an example, and notas an architectural limitation for the present invention.

Referring to FIG. 2, a block diagram of a data processing system thatmay be implemented as a server, such as servers 104, 114, 118 in FIG.1B, is depicted in accordance with a preferred embodiment of the presentinvention. Data processing system 200 may be a symmetric multiprocessor(SMP) system including a plurality of processors 202 and 204 connectedto system bus 206. Alternatively, a single processor system may beemployed. Also connected to system bus 206 is memory controller/cache208, which provides an interface to local memory 209. I/O bus bridge 210is connected to system bus 206 and provides an interface to I/O bus 212.Memory controller/cache 208 and I/O bus bridge 210 may be integrated asdepicted.

Peripheral component interconnect (PCI) bus bridge 214 connected to I/Obus 212 provides an interface to PCI local bus 216. A number of modemsmay be connected to PCI bus 216. Typical PCI bus implementations willsupport four PCI expansion slots or add-in connectors. Communicationslinks to network computers 108–112 in FIG. 1B may be provided throughmodem 218 and network adapter 220 connected to PCI local bus 216 throughadd-in boards.

Additional PCI bus bridges 222 and 224 provide interfaces for additionalPCI buses 226 and 228, from which additional modems or network adaptersmay be supported. In this manner, data processing system 200 allowsconnections to multiple network computers. A memory-mapped graphicsadapter 230 and hard disk 232 may also be connected to I/O bus 212 asdepicted, either directly or indirectly.

Those of ordinary skill in the art will appreciate that the hardwaredepicted in FIG. 2 may vary. For example, other peripheral devices, suchas optical disk drives and the like, also may be used in addition to orin place of the hardware depicted. The depicted example is not meant toimply architectural limitations with respect to the present invention.

The data processing system depicted in FIG. 2 may be, for example, anIBM RISC/System 6000 system, a product of International BusinessMachines Corporation in Armonk, N.Y., running the Advanced InteractiveExecutive (AIX) operating system.

With reference now to FIG. 3, a block diagram illustrating a dataprocessing system is depicted in which the present invention may beimplemented. Data processing system 300 is an example of a clientcomputer. Data processing system 300 employs a peripheral componentinterconnect (PCI) local bus architecture. Although the depicted exampleemploys a PCI bus, other bus architectures such as Accelerated GraphicsPort (AGP) and Industry Standard Architecture (ISA) may be used.Processor 302 and main memory 304 are connected to PCI local bus 306through PCI bridge 308. PCI bridge 308 also may include an integratedmemory controller and cache memory for processor 302. Additionalconnections to PCI local bus 306 may be made through direct componentinterconnection or through add-in boards. In the depicted example, localarea network (LAN) adapter 310, SCSI host bus adapter 312, and expansionbus interface 314 are connected to PCI local bus 306 by direct componentconnection. In contrast, audio adapter 316, graphics adapter 318, andaudio/video adapter 319 are connected to PCI local bus 306 by add-inboards inserted into expansion slots. Expansion bus interface 314provides a connection for a keyboard and mouse adapter 320, modem 322,and additional memory 324. Small computer system interface (SCSI) hostbus adapter 312 provides a connection for hard disk drive 326, tapedrive 328, and CD-ROM drive 330. Typical PCI local bus implementationswill support three or four PCI expansion slots or add-in connectors.

An operating system runs on processor 302 and is used to coordinate andprovide control of various components within data processing system 300in FIG. 3. The operating system may be a commercially availableoperating system, such as Windows 2000, which is available fromMicrosoft Corporation. An object oriented programming system such asJava may run in conjunction with the operating system and provide callsto the operating system from Java programs or applications executing ondata processing system 300. “Java” is a trademark of Sun Microsystems,Inc. Instructions for the operating system, the object-orientedoperating system, and applications or programs are located on storagedevices, such as hard disk drive 326, and may be loaded into main memory304 for execution by processor 302.

Those of ordinary skill in the art will appreciate that the hardware inFIG. 3 may vary depending on the implementation. Other internal hardwareor peripheral devices, such as flash ROM (or equivalent nonvolatilememory) or optical disk drives and the like, may be used in addition toor in place of the hardware depicted in FIG. 3. Also, the processes ofthe present invention may be applied to a multiprocessor data processingsystem.

As another example, data processing system 300 may be a stand-alonesystem configured to be bootable without relying on some type of networkcommunication interface, whether or not data processing system 300comprises some type of network communication interface. As a furtherexample, data processing system 300 may be a Personal Digital Assistant(PDA) device, which is configured with ROM and/or flash ROM in order toprovide non-volatile memory for storing operating system files and/oruser-generated data.

The depicted example in FIG. 3 and above-described examples are notmeant to imply architectural limitations. For example, data processingsystem 300 also may be a notebook computer or hand held computer inaddition to taking the form of a PDA. Data processing system 300 alsomay be a kiosk or a Web appliance.

The present invention provides a mechanism in the application layer andTCP layer of the standard TCP/IP stack that moves the monitoringfunction of the socket receive buffer from the recv( ) function to theTCP processing. In moving the monitoring function to the TCP processing,much of the overhead associated with waking up the recv( ) thread andputting it back to sleep in the prior art is eliminated. In addition,the problems with regard to not sending out acknowledgment messages arealso overcome by the present invention.

With the present invention, a new field, so_rcvlen is added to thesocket structure, i.e. the kernel structure per socket that contains thesocket send and receive buffer queues, points to the communicationprotocol (TCP, UDP, UNIX, etc.) private data, and contains state anderror information. This new field identifies the bulk read sizerequested by the user. The kernel of the recv( ) function, i.e. thefundamental part of a the recv( ) function that resides in memory at alltimes, is also modified so that it sets the so_rcvlen to the sizerequested by the user prior to the recv( ) function going to sleep andwaiting for the full data size requested by the user.

In addition, a new flag, SP_MSGWAITALL, is provided in the field of thesocket structure that stores flags. The SP_MSGWAITALL flag is used toidentify when the functions of the present invention are to beimplemented, as described hereafter. Thus, the application layer of thepresent invention is modified from the application layer of the priorart by providing a new field in the socket structure for identifying therequested bulk read size and by providing a new flag for identifyingwhen the functions of the present invention are to be implemented.

With regard to the TCP layer, the present invention modifies the TCPlayer of the prior art by providing additional functionality forperforming functions based on the settings of the so_rcvlen,SP_MSGWAITALL, and the amount of data stored in the socket receivebuffer, as detailed hereafter.

At different TCP states, TCP is required (under RFC 793), to handleincoming segments differently. TCP state transitions occur based onevents such as incoming TCP segments. TCP input processing is the TCPcode that is implemented to handle a TCP segment that arrives at a TCPenabled device. Some of the different tasks performed by TCP inputprocessing include validating an incoming segment for data corruption,determining to which socket, i.e. which user application, this segmentbelongs, delivering data to the user application, performing statetransitions (e.g., when the incoming segment indicates the connectionneeds to be closed), and the like.

With the present invention, in the TCP input processing, the followingnew functions are provided. When data is received for a particularsocket, the current setting of the SP_MSGWAITALL is checked. If theSP_MSGWAITALL flag is set on the socket, a determination is made as towhether the amount of data stored in the socket receive buffer is lessthan the value of so_rcvlen. If the amount of data stored in the socketreceive buffer is less than the value of so_rcvlen, the TCP inputprocessing does not wake up the recv( ) thread. However, for everyalternate segment, the TCP input processing sends back an acknowledgment(ACK).

Thus, the overhead of waking up the recv( ) thread and putting it backto sleep again, because there is not enough data in the receive socketbuffer, is avoided. Additionally, the present invention sends outacknowledgments for every alternate segment and thus, the problemsoutlined above with regard to the prior art not sending outacknowledgments are avoided.

In the TCP output processing, the following new functions are provided.When the SP_MSGWAITALL flag is set in the socket and the amount of datain the socket receive buffer is less than so_rcvlen, the full window isadvertised. The full window is the window size advertised by thereceiving computing device to the sending computing device at connectionsetup time. By advertising the full window, the receiving computingdevice informs the sending computing device of the available socketbuffer space. This size then dynamically changes depending on how muchspace is available at any one time in the socket buffer. The spaceavailable is the original, or full window, size minus the amount of dataqueued in the socket buffer.

By advertising the full window in this manner, the present inventionemulates the situation where the data stored in the socket receivebuffer has been read by the application. This is only done when theSP_MSGWAITALL flag is set which guarantees that the recv( ) thread isblocked waiting for a sufficient amount of data. When the application isnot looking at this socket at all, such as when the application is busydoing something else or when the system is busy, the operations of theprior art are implemented, i.e. the window is reduced as the bufferfills up.

Once the TCP input processing determines that there is at least anamount of data in the socket receive buffer equal to the value ofso_rcvlen, the TCP input processing will wake up the recv( ) thread. Therecv( ) thread is modified to reset the SP_MSGWAITALL flag and copy anamount of data equal to so_rcvlen in the socket receive buffer to theapplication's buffer. The next time the user calls recv( ) withMSG_WAITALL, if there is partial data stored in the socket receivebuffer, a window update is sent out to the sending computing device.This will let the sender resume sending in case it is waiting for thewindow to be opened, i.e. Waiting for the receiver to inform the senderthat there is space available in the receive socket buffer.

FIG. 4 is a flowchart outlining an exemplary operation of the presentinvention. The functions described in FIG. 4 may be performed insoftware, hardware, or a combination of software and hardware. In apreferred embodiment, the functions of the present invention detailed inFIG. 4 are performed as software instructions executed on a processor,such as those shown in FIGS. 2 and 3.

As shown in FIG. 4, the operation starts with invoking the recv( )function (step 410). The value for so_rcvlen is set (step 420). Adetermination is then made as to whether there is an amount of datastored in the socket receive buffer at least equal to the value ofso_rcvlen (step 430). If so, an amount of data equal to so_rcvlen iscopied to the application buffer (step 500).

If there is not at least an amount of data equal to so_rcvlen in thesocket receive buffer, the SP_MSGWAITALL flag is set (step 440). Theoperation then waits for a data segment to be received in the socketreceive buffer (step 450). A determination is made as to whether a datasegment is received in the socket receive buffer (step 460). If not, theoperation returns to step 450 and continues to wait for a data segment.

Upon receiving a data segment in the socket receive buffer, adetermination is made as to whether there is at least an amount of datain the socket receive buffer equal to so_rcvlen (step 470). If so, thenthe recv( ) thread is awaken (step 480) and the SP_MSGWAITALL flag isreset (step 490). An amount of data equal to so_rcvlen is then copiedfrom the socket receive buffer to the application buffer, i.e. theapplication that called the recv( ) function (step 500).

If there is not at least an amount of data equal to so_rcvlen in thesocket receive buffer, the modified TCP input and output processing areperformed (step 510). This includes not waking up the recv( ) thread andsending out acknowledgments for every alternate segment received. Inaddition, the full window is advertised to the sender of the datasegment. The operation then returns to step 450 and waits for the nextdata segment to arrive at the socket receive buffer.

Thus, the present invention provides a mechanism for performing bulkread operations using a socket receive buffer that avoids the problemsassociated with the prior art. The present invention moves themonitoring functions from the application layer recv( ) function to theTCP layer and thus, eliminates the need to repeatedly awaken and putback to sleep the recv( ) function. This provides a considerable savingswith regard to processing cycles due to the reduction in overhead. Inaddition, the present invention provides acknowledgment messages forevery alternate segment and thus, the problems associated with thenon-transmission of acknowledgment messages in the prior art areavoided. Moreover, the full window is advertised to senders of datasegments so that the delay associate with the prior art is avoided.

It is important to note that while the present invention has beendescribed in the context of a fully functioning data processing system,those of ordinary skill in the art will appreciate that the processes ofthe present invention are capable of being distributed in the form of acomputer readable medium of instructions and a variety of forms and thatthe present invention applies equally regardless of the particular typeof signal bearing media actually used to carry out the distribution.Examples of computer readable media include recordable-type media such afloppy disc, a hard disk drive, a RAM, and CD-ROMs and transmission-typemedia such as digital and analog communications links.

The description of the present invention has been presented for purposesof illustration and description, but is not intended to be exhaustive orlimited to the invention in the form disclosed. Many modifications andvariations will be apparent to those of ordinary skill in the art. Theembodiment was chosen and described in order to best explain theprinciples of the invention, the practical application, and to enableothers of ordinary skill in the art to understand the invention forvarious embodiments with various modifications as are suited to theparticular use contemplated.

1. A method of performing a bulk read, comprising: creating a socketstructure for a socket associated with a logical port, wherein thesocket structure contains a socket receive buffer; initiating, by a userassociated with the logical port, a bulk read function having a bulkread size; storing the bulk read size in a field in the socketstructure; checkine state of the flag in the socket structure;determining if an amount of data stored in the socket receive buffer isequal to or greater than the balk read size, if the flag is set;activating the bulk read function only when there is an amount of datain the socket receive buffer equal to or greater than the bulk read sizein response to setting of the flag in the socket structure; andinforming a sending device that a full window size of data is availablein the socket receive buffer, if the flag is set and the amount of datastored in the socket receive buffer is less than the bulk read size. 2.The method of claim 1, wherein the user is an application associatedwith the logical port.
 3. The method of claim 1, wherein the bulk readsize is a size identified by the user.
 4. The method of claim 1, whereinif the amount of data stored in the socket receive buffer is less thanthe bulk read size, the bulk read function is not activated.
 5. Themethod of claim 4, further comprising: receiving a Transport ControlProtocol (TCP) segment from a sending device, wherein the steps ofchecking the state of the flag in the socket structure and determiningif an amount of data in the socket receive buffer is less than the bulkread size are performed in response to receiving the TCP segment.
 6. Themethod of claim 5, wherein if the amount of data stored in the socketreceive buffer is less than the bulk read size, an acknowledgment issent to the sending device for every alternate TCP segment received. 7.The method of claim 1, wherein the steps of checking a state of theflag, determining if an amount of data stored in the socket receivebuffer is less than the bulk read size, and activating the bulk readfunction are performed in a Transport Control Protocol (TCP) layer. 8.The method of claim 1, wherein activating the bulk read functionincludes: copying an amount of data equal to the bulk read size from thesocket receive buffer to an application buffer; and resetting the flag.9. The method of claim 8, wherein activating the bulk read functionfurther includes: determining if there is data stored in the socketreceive buffer after copying the amount of data equal to the bulk readsize from the socket receive buffer to the application buffer; andsending a window update to a sending device if there is data stored inthe socket receive buffer after the copying.
 10. An apparatus forperforming a bulk read, comprising: means for creating a socketstructure for a socket associated with a logical port, wherein thesocket structure contains a socket receive buffer; means for initiating,by a user associated with the logical port, a bulk read function havinga bulk read size; means for storing the bulk read size in a field in thesocket structure; means for checking a state of the flag in the socketstructure; means for determining if an amount of data stored in thesocket receive buffer is equal to or greater than the bulk read size, ifthe flag is set; means for activating the bulk read function only whenthere is an amount of data in the socket receive buffer equal to orgreater than the bulk read size in response to setting of the flag inthe socket structure; and means for informing a sending device that afall window size of data is available in the socket receive buffet, ifthe flag is set and the amount of data stored in the socket receiverbuffer is less than bulk read size.
 11. The apparatus of claim 10,wherein the user is an application associated with the logical port. 12.The apparatus of claim 10, wherein the bulk read size is a sizeidentified by the user.
 13. The apparatus of claim 10, wherein if theamount of data stored in the socket receive buffer is less than the bulkread size, the bulk read function is not activated by the means foractivating the bulk read function.
 14. The apparatus of claim 13,further comprising: means for receiving a Transport Control Protocol(TCP) segment from a sending device, wherein the means for checking thestate of the flag in the socket structure and means for determining ifan amount of data in the socket receive buffer is less than the bulkread size operate in response to receiving the TCP segment.
 15. Theapparatus of claim 14, further comprising a means far sending anacknowledgment to the sending device for every alternate TCP segmentreceived, if the amount of data stored in the socket receive buffer isless than the bulk read size.
 16. The method of claim 10, wherein themeans for checking a state of the flag, means for determining if anamount of data stored in the socket receive buffer is less than to bulkread size, and means for activating the bulk read function are part of aTransport Control Protocol (TCP) layer.
 17. The apparatus of claim 10,wherein the means for activating the bulk read function include: meansfor copying an amount of data equal to the bulk read size from thesocket receive buffer to an application buffer; and means for resettingto flag.
 18. The apparatus of claim 17, wherein the means for activatingthe bulk read function further includes: means for determining if thereis data stored in the socket receive buffer after copying the amount ofdata equal to the bulk read size from the socket receive buffer to theapplication buffer; and means for sending a window update to a sendingdevice if there is data stored in the socket receive buffer alter thecopying.
 19. A computer program product in a computer readable mediumfor performing a bulk read, comprising: instructions for creating asocket structure for a socket associated with a logical port, whereinthe socket structure contains a socket receive buffer; instructions forinitiating, by a user associated with the logical port, a bulk readfunction having a bulk read size; instructions for storing the bulk readsize in a field in the socket structure; instructions for checking astate of the flag in the socket structure; instructions for determiningif an amount of data stored in the socket receive buffer is equal to orgreater than the bulk read size, if the flag is set; instructions foractivating to bulk read function only when there is an amount of data inthe socket receive buffer equal to or greater than the bulk read size inresponse to setting of the flag in the socket structure; andinstructions for informing a sending device that a full window size ofdata is available in the socket receive buffer, if the flag is set andthe amount of data stored in the socket receive buffer is less than thebulk read size.
 20. The computer program product of claim 10, whereinthe user is an application associated wit the logical port.
 21. Thecomputer program product of claim 19, wherein the bulk read size is asize identified by the user.
 22. The computer program product of claim19, wherein if the amount of data stored in the socket receive buffer isless than the bulk read size, the instructions are not executed.
 23. Thecomputer program product of claim 22, further comprising: instructionsfor receiving a Transport Control Protocol (TCP) segment from a sendingdevice, wherein the fourth instructions for checking the state of theflag in the socket structure and fifth instructions for determining ifan amount of data in the socket receive buffer is less than the bulkread size are executed in response to receiving the TCP segment.
 24. Thecomputer program product of claim 23, further comprising instructionsfor sending an acknowledgment to the sending device for every alternateTCP segment received, if the amount of data stored in the socket receivebuffer is less than the bulk read size.
 25. The computer program productof claim 19, wherein the instructions for checking a state of the flag,instructions for determining if an amount of data stored in the socketreceive buffer is less than the bulk read size, and instructions foractivating the bulk read function are executed in a Transport ControlProtocol (TCP) layer.
 26. The computer program product of claim 19,wherein the instructions far activating the bulk read function include:instructions for copying an amount of data equal to the bulk read sizefrom the socket receive buffer to an application buffer; andinstructions far resetting the flag.
 27. The computerprogram product ofclaim 26, wherein the instructions for activating the bulk read functionfurther include: instructions for determining if there is data stored inthe socket receive buffer after copying the amount of data equal to thebulk read size from the socket receive buffer to the application buffer,and instructions for sending a window update to a sending device ifthere is data stored in the socket receive buffer after the copying. 28.The method of claim 1, further comprising placing the bulk read functionin an inactive state if an amount of data in the socket receive bufferis not equal to or greater than the bulk read size.
 29. The apparatus ofclaim 10, further comprising means for placing the bulk read function inan inactive state if an amount of data in the socket receive buffer isnot equal to or greater than the bulk read size.
 30. The computerprogram product of claim 19, further comprising instructions for placingthe bulk read function in an inactive state if an amount of data in thesocket receive buffer is not equal to or greater than the bulk readsize.